Fast Inverse Reinforcement Learning with Interval Consistent Graph for Driving Behavior Prediction

نویسندگان

  • Masamichi Shimosaka
  • Junichi Sato
  • Kazuhito Takenaka
  • Kentarou Hitomi
چکیده

Maximum entropy inverse reinforcement learning (MaxEnt IRL) is an effective approach for learning the underlying rewards of demonstrated human behavior, while it is intractable in high-dimensional state space due to the exponential growth of calculation cost. In recent years, a few works on approximating MaxEnt IRL in large state spaces by graphs provide successful results, however, types of state space models are quite limited. In this work, we extend them to more generic large state space models with graphs where time interval consistency of Markov decision processes are guaranteed. We validate our proposed method in the context of driving behavior prediction. Experimental results using actual driving data confirm the superiority of our algorithm in both prediction performance and computational cost over other existing

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inverse Reinforcement Learning through Structured Classification

This paper adresses the inverse reinforcement learning (IRL) problem, that is inferring a reward for which a demonstrated expert behavior is optimal. We introduce a new algorithm, SCIRL, whose principle is to use the so-called feature expectation of the expert as the parameterization of the score function of a multiclass classifier. This approach produces a reward function for which the expert ...

متن کامل

Graph Based Inverse Reinforcement Learning for Social Robot Navigation

Abstract. Mobile robots that operate in human populated spaces are required to act in ways perceived as socially normative with respect to the environment considered. Past approaches have merely focussed on robot centric optimality criteria such as path lengths, heading changes, time to goal etc while ignoring social aspects like personal space intrusions. We learn how to navigate socially from...

متن کامل

Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling

Recent advances in the field of inverse reinforcement learning (IRL) have yielded sophisticated frameworks which relax the original modeling assumption that the behavior of an observed agent reflects only a single intention. Instead, the demonstration data is typically divided into parts, to account for the fact that different trajectories may correspond to different intentions, e.g., because t...

متن کامل

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated negotiation skills with other road users when overtaking, giving way, merging, taking left and right turns and while pushing ahead in unstructured urban roadways. Since there are many possible scenarios, manually tackling all possible cases will likely yield a too simplistic policy. Moreover, one must ba...

متن کامل

Backward inhibitory learning in honeybees: a behavioral analysis of reinforcement processing.

One class of theoretical accounts of associative learning suggests that reinforcers are processed according to learning rules that minimize the predictive error between the expected strength of future reinforcement and its actual strength. The omission of reinforcement in a situation where it is expected leads to inhibitory learning of stimuli indicative for such a violation of the prediction. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017